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A MIXED EFFECTS MODEL FOR LONGITUDINAL RELATIONAL 
AND NETWORK DATA, WITH APPLICATIONS TO 
INTERNATIONAL TRADE AND CONFLICT 

By Anton H. Westveld and Peter D. Hoff 1 

University of Nevada, Las Vegas and University of Washington, Seattle 

The focus of this paper is an approach to the modeling of longi- 
tudinal social network or relational data. Such data arise from mea- 
surements on pairs of objects or actors made at regular temporal 
intervals, resulting in a social network for each point in time. In this 
article we represent the network and temporal dependencies with 
a random effects model, resulting in a stochastic process defined by 
a set of stationary covariance matrices. Our approach builds upon 
the social relations models of Warner, Kenny and Stoto [Journal of 
Personality and Social Psychology 37 (1979) 1742-1757] and Gill and 
Swartz [Canad. J. Statist. 29 (2001) 321-331] and allows for an intra- 
and inter-temporal representation of network structures. We apply 
the methodology to two longitudinal data sets: international trade 
(continuous response) and militarized interstate disputes (binary re- 
sponse) . 

1. Longitudinal network (relational) data. Radcliffe-Brown (1940) stated 
that an understanding of the "complex network of social relations" can be 
gained by measuring the relations or interactions within a set of actors. Since 
pairwise relations are the most elemental type of relationship, relational data 
which consist of measurements made on pairs of actors are ubiquitous. Our 
focus in this article is on relational data from the field of political science, 
including (1) trade between nations, and (2) militarized disputes between 
nations. For such data, we let j/j , denote the value of the measurement on 
the potentially ordered pair of actors (i,j). In this paper we refer to social 
network data or relational data as the set of measurements of relations on 
dyads for a group of actors under study. These measurements could be bi- 
nary, ordinal or continuous, as such, the methodology applies to a broad 
range of applications beyond those discussed in this paper. 
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In the case of international trade, yij is the directed level of trade from 
nation i to nation j. Since the relation is directed, yij is not necessarily 
equal to y^. Typically, social network data, directed or undirected, are rep- 
resented by a socio-matrix [Wasserman and Faust (1994)], with the ith row 
representing data for which actor i is the sender, and column j represent- 
ing data for which j is the receiver. Since the data are based on pairs of 
actors, the diagonal representing the relationships of actors with themselves 
is generally absent from the socio-matrix. 

Many researchers have worked on models for this data structure. The 
seminal work on relational data of this form was done by Warner, Kenny 
and Stoto (1979), where a method of moments estimation procedure was 
developed based upon an ANOVA style decomposition. Models of this form 
have come to be known as social relations models or models for round robin 
data. Wong's (1982) work derived maximum likelihood estimators for these 
types of models, and Gill and Swartz (2001) studied method of moments, 
maximum likelihood and Bayesian estimation procedures for the same prob- 
lem. More broadly, Li (2002) and Li and Loken (2002) developed a general 
unified theory for dyadic data which derives the social relations model and 
other similar models from principles of group symmetry and exchangeability. 

In a series of papers [Hoff, Raftery and Handcock (2002); Hoff (2003, 
2005, 2007)], the social relations model was expanded in several directions: 
(1) A latent social space was introduced to capture patterns of transitivity, 
balance and clusterability that are often exhibited in dyadic data [Wasser- 
man and Faust (1994)]; (2) A generalized linear model was developed to 
allow for a variety of data types (binary, ordinal and continuous); (3) A Baye- 
sian estimation procedure was thoroughly outlined for (1) and (2) to estimate 
the model parameters. 

However, all models mentioned thus far are for static relational data. Of- 
ten, scientific questions are concerned with the evolution of networks over 
time. For example, in the field of international relations, questions related 
to the evolution of international trade or interstate conflicts are of great 
interest [Hoff and Ward (2003); Ward and Hoff (2007); Ward, Siverson and 
Cao (2007)]. In the field of biology, an understanding of the evolution of 
interactions of biological entities under various experimental stimuli could 
provide important insights [Barabasi and Oltvar (2004)]. With such appli- 
cations in mind, this paper expands the social relations model to account 
for dependence over time. 

This article proposes a model that accounts for temporal dependence 
among all pairwise measurements of a set of actors, thus, it falls into the 
realm of longitudinal data analysis methodology. To date, there has been 
little work on models which account for both network and temporal depen- 
dencies. A notable exception is the work by Thomas Snijders and coauthors 
[Huisman and Snijders 2003; Snijders, van de Bunt and Steglich (2010); Sni- 
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jders, Koskinen and Schweinberger (2010)] which developed an actor-orien- 
ted model for network evolution that incorporated individual-level attributes. 
This approach is based on an economic model of rational choice, whereby 
individuals make unilateral changes to their networks and behaviors in order 
to maximize personal utility functions. Parameter estimates describe indi- 
vidual's utilities for various network configurations. Parameter estimation 
methods for such a model have been developed into a freely-available soft- 
ware package (http://stat.gamma.rug.nl/siena.html), which has been 
applied to a number of data sets. 

While this work has been groundbreaking, the applicability of an actor- 
oriented model may be limited to certain types of networks. As described 
by the primary developers of this approach [Snijders, Steglich and Schwein- 
berger (2007)] , such a model may not be appropriate in situations for which 
network and behavioral data might depend on unobserved latent variables. 
Additionally, the interpretation of the parameters in an actor-based choice 
model may be problematic if the data do not actually represent choices of 
the actors, but rather outcomes determined by other actors, which may be 
constrained by circumstances beyond an individual's control. In the con- 
text of trade, for example, exports from one country to another may be 
determined by forces of supply and demand beyond just the pair. In the 
context of international conflict, countries are often unwilling participants 
in militarized disputes. 

Recently, Hanneke, Fu and Xing (2010) considered a temporal exten- 
sion of the exponential random graph modeling (ERGM) framework [Frank 
and Strauss (1986); Hunter and Handcock (2006); Handcock, Raftery and 
Tantrum (2007)]. Their work is similar to that of Thomas Snijders and coau- 
thors in that it parameterizes various network configurations, however, they 
do not take an agent-based approach to the construction of the model. 

Another approach to modeling dynamic network data is discussed in Xing, 
Fu and Song (2010). Building on ideas of Erosheva, Fienberg and Lafferty 
(2004) and Airoldi et al. (2005), these authors model each actor as having 
partial memberships to several groups. Relationships between individuals 
are determined by the groups of which they are members. Such models 
often result in a concise description of the data, as the large number of 
relationships between actors are summarized by the relationships between 
a small number of groups to which the actors belong. 

In contrast to an actor-oriented utility model, ERGM, or a group-member- 
ship model, the approach we propose is more statistical, in that the main 
parameters in our model represent expectations and covariances of relational 
measurements leading to inference about network characteristics. Our reason 
for taking such an approach is that in the empirical study of international 
relations, focus is primarily on mean or regression effects and assessments of 
their statistical significance. As discussed in Ward and Hoff (2007), common 
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practice is to merge data on all pairs of countries across several years and 
base inference on ordinary least squares estimates, treating all observations 
as independent. By ignoring network and temporal dependence, such an ap- 
proach can potentially dramatically overestimate the significance of results 
and precision of estimates. One of the objectives of our methodology is to 
provide mean and regression estimates, by properly accounting for statisti- 
cal dependencies in the data. Additionally, our modeling framework is very 
flexible and extendable: Using a generalized linear model framework, it can 
accommodate continuous and ordinal relational data. This could include 
data on intensity or duration of relationships, or the number of contacts 
between two individuals. 

The next section will outline the set of possible second-order dependencies 
inherent in this data structure. Section 3 represents the dependencies with 
a mixed-effects model, and Section 4 outlines a Bayesian approach to param- 
eter estimation. Sections 5 and 6 provide in-depth data analysis examples 
involving international trade and militarized disputes, including comparisons 
to simpler modeling approaches. A discussion follows in Section 7. 

2. Dependence structure for LSR data. Figure 1 summarizes the set 
of pairwise (second order) potential dependencies for directed longitudinal 
social network data that we will consider in this article. These are the de- 
pendencies possible assuming a dependence structure in which two relational 
measurements are dependent if and only if they share a common actor. In 
the figure, three nonidentical actors k and two time points t\,t2 are used 
to illustrate the dependencies. The arrow i — —> j represents the random 
variable yijt f° r a particular relationship from actor i to actor j at time t. 

If we are to study patterns of international trade, i — —> j might represent 
the monetary value of the exports from nation % to nation j during year t. 
Based on the figure, two directed relations are potentially dependent only if 
they share a common actor, regardless of the relation's time index. In other 
words, the random variables y a ,b,ti an d y c ,d,i 2 are independent for all t if 
{a,b}D{c, d} = 0. 

To provide a description of the five dependencies depicted in Figure 1, let 
us first consider a fixed time point t\ = ti = t. Under this condition, (a) rep- 
resents the potential dependence among measurements having a common 
sender (i.e., the "row effects"). As an example of such dependence, consider 
the exports from the United States and those from Morocco in a given year. 
Due to the overall difference in trade activity of these two nations, we might 
expect the exports from Morocco to other countries to be more similar in 
magnitude to each other than to the exports from the United States to 
other countries. Similarly, (b) represents potential dependence among mea- 
surements having a common receiver (i.e., the "column effects"). Considering 
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(d) reciprocity (e) observational dependence 
Fig. 1. Second order dependencies for longitudinal directed social network data. 

the context of international trade again, some countries consume more goods 
than other countries, which could lead to within-column correlation of trade 
values. Next, (c) represents dependence between the relations sent and re- 
ceived by the same actor. For example, countries that import at a higher 
than average rate may also export at a higher rate. Next, (d) represents the 
idea of reciprocity or dependence between the directed relations of a pair of 
actors, such as between a pair of international trading partners or disputes 
between a pair of nations. 

For ti / t<i we additionally consider temporal dependence: The figure 
in (e) indicates the dependence among a pair of actors across time. 2 

3. Mixed effects model with Markov temporal dependence. We base 
our longitudinal network model on multivariate normal distributions, with 
nonzero covariances corresponding to the dependencies represented by Fig- 
ure 1. For example, we allow Cov(y a) 6 )tl ,i/ C)£ ^ 2 ) ^ if {a, b} (~1 {c,d} ^ 0. 
Otherwise, this covariance is zero. Based on this, the complete set of nonzero 
covariances are in Table 1. Such a covariance structure can be obtained via 
a mixed effects model, defined by equations (l)-(3) that follow: 



2 In the case of nondirected network data, the minimal set of dependencies are obtained 
by replacing the directed edges in Figure 1 with nondirected edges. In this case, cases (a), 
(b) and (c) essentially represent the same dependencies, as do (d) and (e). This leaves 
only two minimal dependencies. 
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Table 1 

Covariances in the longitudinal social relations model 



t\ — t-2 t\ < t-2 



(a) 
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= Cav{y iJM ,yi,k,t 2 ) = 


5l,[ti] 


£l,Iti,*a] 
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Cov(i — — 


j,k 


<2 




= Cov(yij, tl: yk,j,t 2 ) = 


£ 2 


6,[ii,i 2 ] 
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Cov(i — — 
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3,3 
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H 
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= Cov(j/i,j, tl ,2/j,fc,t 2 ) = 
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^3,[ii] 

^4,[i!] 


^3,[ti,t a ] 
^4,[ti,*a] 


(d) 


Cov(i — — 


3,3 


1-2 


>i) 


= Cav(Vi,j,t 1 ,yj,i,t 2 ) = 


Cs,[ tl ] 


?5,[ti,t 2 ] 


(e) 


Cov(i — 


3,i 


>2 


3) 


= Cav(yij, tl: yi,j,t 2 ) = 


£ 2 


^6,[ti,t a ] 



effects. This linear decomposition consists of a sending effect Si t, a receiving 
effect rj s t and a residual error term gij,t- For a fixed t, the network dependen- 
cies can be characterized by specifying covariance structures for the random 
effects in (1). 

While Figure 1 describes the structure of the network dependencies (pair- 
wise dependencies in the actor domain), it does not provide guidance about 
the structure of the temporal dependence. We accommodate temporal de- 
pendence by expanding the model to allow the random effects to be cor- 
related over time. We consider the following first order (Markov) auto- 
regressive structure for the random effects: 

(Si,t, Ti )t )' = ^sr{si,t-l,n,t-iy + £sr,t, 

(2) (9i,j,u9j,i,t)' = ®gg(9i,j,t-U9j,i,t-l) + e gg,t, 

where $ = ( ^ $ = ( ^ ^" 

\4>rs 4>r J ' 99 \4>gg 4>g 

and e ST) t and s gg ,t are independent mean-zero bivariate normal vectors with 
covariance matrices T sr and T gg : 



Is lsr\ v = ( Ig \ 9 l 2 g\ 



(3) rSr ~W 7? J' T "-\Xggl 2 g ll ) 

The resulting covariance matrix for the vector sr^ = (s^i,^!, . . . , s^Tj^t)' 
can be written as 



Cov(srj) = S s 



/ S sr (0) E sr (l) ••• £ sr (T-l)\ 

S s ,(l)' £ sr (0) ••• S sr (T-2) 



\E sr (T-l)' E ar (T-2) / ••• S sr (0) / 

where £ sr (d) depends on <1> ST ., T sr and the time lag d. The covariance matrix 
of the vector = (gij^gj&i, • • • , 9i,j,T, 9j,i,T)' has a similar block Toeplitz 
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Table 2 

Covariances based upon the stationary mixed effects model 



d=0 d>0 



(a) 


Cov(yi,3,t,yi,k,t+d) = 


^? 
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(c) 


Gav(yi,j,t,yj,k,t+d) = 


O 3 or 




C° v (yi,j,t+<i,J/j,fe,t) = 




&sr,d 


(d) 


Cov(j/i,j, t ,yij jt+ d) = 


(J s + cr. r + cr ff 




(e) 


Cav(yi,j, t ,Vj,i,t+d) = 




&gg,d H~ Csr.d ~\~ &rs,d 



structure, which we write as Cov^jjj) = £ 99 , and is made up of the blocks 
{X 99 (0), . . . , ^ gg (T — 1)}. Putting the two sources of variation together, Ta- 
ble 2 outlines the set of potentially nonzero covariances defined by the ran- 
dom effects model. The different cr's in Table 2 replace their more general 
counterparts, the £'s of Table 1. 

Note that if we were to consider a static network, the covariances given 
by £ sr (0) and S ss (0) would represent those for the social relations models as 
outlined in Warner, Kenny and Stoto (1979) and Gill and Swartz (2001). As 
such, those models are submodels of the one defined by equations (l)-(3). 

Through the use of a generalized linear model [McCullagh and Nelder 
(1989)], the mixed effects model for Gaussian longitudinal social relations 
data can be extended to analyze relations that are not appropriately modeled 
by a Gaussian distribution, such as binary responses or counts. This is done 
by using the above model to describe a linear predictor 9ij t t in a generalized 
linear model. This leads to the following formulation: 

E(yij,t\9i,j,t) = h(0ij t t), 

8i,j,t = x 'ij,tPt + s ijt + r j)t + gij,f 

Under the model, the Ui,j,ts are conditionally independent given the #i,j,t's, 
so that we have 

A-i A T 

p(y\6)=Y\ II ]^p(yid,t\ e i,j,t)p(yj,i,t\9j,i,t)- 

i=l j=i+lt=X 

The covariance structure here is approximately that of the Gaussian model 
multiplied by a factor depending on the link function h [Hoff (2005); West- 
veld (2007)], indicating that the second order dependence outlined by Fig- 
ure 1 is still captured: 

Cov(y iujlttl ,y i2d2tt2 ) ^ Cov(e iujlttl ,e i2d2M ) x h' (x' il jutl (3 tl )h' (x' i2 j2 t2 (3 t2 ). 
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4. Parameter estimation. Estimation of model parameters is most easily 
done in the context of Bayesian inference. In this section we present a general 
Markov chain Monte Carlo (MCMC) algorithm for continuous data which 
are modeled as Gaussian, and binary data which are modeled through a par- 
ticular probit formulation based on the work of Albert and Chib (1993) and 
Chib and Greenberg (1998). 

4.1. Gaussian mixed effects model. The model fully defined by equa- 
tions (l)-(3) has the following parameters O that need to be estimated: = 

{(/3 t ]t = 1, . . . ,T), (4> s ,<t)sr, 4>rs,4>r), {(f)g,4>gg), > lr ' 7ar), (lg, A flfl ), { s i,ti f"i,t] 

i = 1, . . . ,A;t = 1, . . . ,T)}. A Bayesian analysis is conducted by examining 
the joint distribution of the parameters in given the data y: 
A-l A 

p (®\y) oc Y\ II duwn (yiij]Hij] + sn + rsj^gg) 

(4) 

v ; A 

x [Jdmvn(sri|0,E sr ) x P(P)P(<S> gg )P(r gg )P($ sr )P(T sr ), 

i=i 

where "dmvn" stands for a multivariate normal density function and 

V[i,j],t = (Vij,uVj,i,t)', V[i,j] = (y'[i,j],v • • • > vfij],r) / > 

V[i,j],t = {P't x iJ,t, Pt x j,i,t}' , = ' • ' ' ^[i,j],T) ' 

sr^t = (si,t,n it )', sn = (sr' il} sr' iT )\ 

rs i>t = (r ijt ,s i:t )', rsi = {rs' ijX , rs'^ T )' . 

The first double product of equation (4) is the density of the data given the 
sender-receiver random effects, the next product is the sampling distribution 
of the random effects, and the remaining terms are the priors for the model. 
We use the following semi-conjugate priors for /3,$ sr ,$ 33 , and T ST : 

/3 = (#,..., /^)'~mvn(M^), 

((f> s ,(f> sr ,<t> rs ,(j) r y ~ mvn(M$ sr , Vf> ar )I($ sr €5), 

& g ,<P gg y~mvn(Mz gg ,V*JI($ gg eS), 

T sr ~ inverse- Wishart (v sr , S^, 1 ). 

The (^-parameters are constrained to ensure that the temporal processes for 
the sender-receiver effects and the residual error terms produce a stationary 
process 5 [Reinsel (1997)]. Such a constraint allows the fixed-effects and 
covariance parameters to represent means and variances of the observed 
data over the observed time period. For an AR(1) model, the constraint is 
satisfied if the absolute value of eigenvalues for the <&'s are less than 1. 
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A conjugate prior for the Toeplitz matrix T gg can be obtained by con- 
sidering a transformation described by Wong (1982). In order to apply this 
approach to our problem, we consider the following bivariate innovations to 
obtain independent bivariate distributions: 

(9i,j,t-,9i,i,t)' = {9i,j,U9j,i,t)' — &gg(9i,j,t-l, 9j,i,t-l) 
~ mvn(0,r 99 ). 

Now using the property of bivariate normal distributions, we can create 
two independent vectors: a iijt t = 9i,j,t + 9j,i,t and b it j tt = 9i,j,t - 9j,i,t, where 
(Hj,t ~ normal(0, a 2 ) and bijj ~ normal(0, a%). We use inverse-gamma priors 
for a 2 a and a\: a\ ~ inverse- gamma(a a , S a ), a\ ~ inverse-gamma(aft, 5b). The 
matrix T gg can be constructed as 7^ = (<7„ + e>f)/4 and X gg = (a 2 — a 2 )/ 

Based on this class of prior distributions, a Markov chain Monte Carlo 
approximation to the joint posterior distribution may be obtained via Gibbs 
sampling for the /3's and the sender-receiver effects, with a Metropolis-Has- 
tings update for <fr sr , & gg , T sr , and T gg . However, the Metropolis-Hastings 
updates are based on their full conditional distributions. For example, con- 
sider that the full conditional distribution of 3> sr is given by 

A 

(5a) P($ sr \-) ^l[dmvn{sr tA \0,^ sr {0)) 

i=l 

A T 

(5b) xj ^dmvn(sr iii |$ sr srj it _i,r sr .) 

i=lt=2 

(5c) x dmvn($ sr |M$ sr ,y<OI($ sr € S). 

If we were to ignore the first product [equation (5a)] and the stationarity 
constraint, the expression above would be proportional to a multivariate 
normal distribution. S incG most of th.6 information about *&sr 

is contained 

in equation (5b) and (5c), the full conditional distribution of <fr sr will be 
close to this multivariate normal distribution. We use this approximation to 
the full conditional distribution as a proposal distribution, but make the nec- 
essary correction in the acceptance probability via the Metropolis-Hastings 
algorithm. We use a similar Metropolis-Hastings proposal for updating <& gg - 
Further details about the MCMC algorithm, including information for up- 
dating T sr and T gg , can be found in the Appendix. 

4.2. Probit mixed effects model. In order to model data that are not 
approximately Gaussian, such as binary data, we move the Gaussian struc- 
ture to a secondary level in the hierarchical model leading to the following 
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formulation: 

yi,j,t~p(y\0ij,t), 

d iJ,t = x 'i,j,tPt + Si jt + r j)t + gi,j s t, 

where p(y\0) represents the probability distribution of the response. For 
example, a probit model for binary data can be obtained by setting p(y\6) = 
<&(9) y [l — For the probit model, we specify the covariance of the 

sender-receiver effects S sr as before based upon the parameters & sr and T sr . 
However, as noted in Albert and Chib (1993), the variance parameter a gg in 
covariance matrix T, gg is not identifiable. For ease of interpretation, we will 
set <jg g to be equal to one so that T, gg is a correlation matrix. In doing this, 
additional constraints are placed on $> gg and T gg . Consider the following 
Yule- Walker equations for a first order auto-regressive process: 

Cov( 9[i)j] , t , 9[iAt+d ) = = | ^ g{d _ ^ . fd>Q 

Since T, gg is a correlation matrix, S 55 (0) is also a correlation matrix, with 
a correlation coefficient p gg . Solving the Yule- Walker equations in terms 
of S gg (0), we have 

x g9 (o) = $ gg x 9g m gg + r gg . 

Writing this out in terms of the individual parameters results in 

1 Pgg\ = (<$>9 ^99 
Pgg 1 J \<f>gg 0g 

(Pg <Pgg 

b gg §g 

Now we solve for 7^ and ^ gg in terms of (j) g , (j) gg and p gg to get 

^g = 1 ~ $g ~ $gg ~ 2 Pgg^g^gg^ 

~1gg = Pgg ~ ^g^gg ~ Pgg$g ~ Pgg^gg- 
If we consider a Bayesian estimation algorithm, we can propose values of 4> g , 
4> gg and p gg such that T gg is a proper covariance matrix and it is guaranteed 
that Yigg will be a correlation matrix. 

The joint density of the parameters conditional on the data y is propor- 
tional to 

A-x A 

p(Q\y) oc ] _ p (y[i,j]\°li,j]) x dmvn^jjl^ 

1=1 1=1+1 

(6) 



1 


Pgg 


) 


Pgg 






' + ( 




"fgg 




Klgg 


ig 



A 

X 

i=l 



^dmvn(sr l |0,S sr ) x P((3)P(<f> gg )P(p gg )P(<f> sr )P(T sr ). 
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Because of the nonidentifiability and reparameterization of T gg discussed 
above, we impose constraints on T gg and & gg via the following priors: 

p gg ~ normal(M Pgg ,y pgg )I(-l < p gg < l)I(T gg is positive definite), 

{<i>g,<i>gg)' ~ mvn(M# 99 , V$ ss )I($ flfl G <S)I(r gg is positive definite). 

To estimate the model parameters, the MCMC algorithm presented in Sec- 
tion 4.1 is modified in two ways: (1) p gg is now explicitly updated, and (2) the 
latent response Qij t must also be updated. For most GLMs a Metropolis- 
Hastings step is required to update the latent response. However, the probit 
model allows for a Gibbs sampling procedure based upon the work of Al- 
bert and Chib (1993) and Chib and Greenberg (1998). The Gibbs sampling 
procedure for each and pair at times t = 1,...,T proceeds by 

sampling the conditional distribution for each Oijj, based on a truncated 
normal distribution: The truncation is to the left of zero if y% j t = and to 
the right of zero if yijt = 1- Further details on the MCMC algorithm can be 
found in the Appendix. 

5. International trade. In this section we apply the methodology to the 
study of yearly international trade between 58 countries from 1981-2000. 3 
A commonly used model for international trade is the gravity model [Tin- 
bergen (1962)] which, based on Newton's law of gravity, posits that the 
force of trade between two countries is proportional to the product of their 
economic "masses" divided by the distance between them (raised to some 
power). Taking logs, a formulation of the gravity model in the context of 
longitudinal trade is given by 

In Trader, t = p Q ,t + Pi,t In GDP i)t + fc,t In GDP jtt + /3 3)i In D w + e iJit , 

where Trader is the trade between two countries at time t, Djj the geo- 
graphic distance between them, and GDPj ^ and GDPjt denote their gross 
domestic products at time t. 4 

Over the past forty years the gravity model of bilateral trade has become 
a benchmark for several reasons: (1) A gravity model can typically explain 



3 A list of countries (including their three-letter ISO codes) used in this analysis can 
be found in the Appendix. Additionally, the data and some of the R code used to fit the 
model are available as supplementary material [Westveld and Hoff (2010)]. 

4 As opposed to Ward and Hoff (2007) and Westveld (2007), real values for GDP and 
the level of trade were used in this paper. The reason the other works used nominal 
values was to avoid modeling the inflation rate for out of sample prediction. An inflator 
using the CPI-A11 Urban Consumers data was calculated to set the amounts into real 
values based on the year 2000. The CPI data can be obtained from the following: http:// 
www.bls.gov/data/home.htm. Note: this CPI data is used in the BLS inflation calculator: 
http://data.bls.gov/cgi-bin/cpicalc.pl. 
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about one-half the variation in bilateral international commerce [Ward and 
Hoff (2007)]; (2) The gravity model can be derived from first principles of 
economic theory [Anderson (1979)]; (3) The linear formulation of the model 
is easy to work with empirically and readily accommodates other factors 
that might affect trade flows. 

Following Ward and Hoff (2007), we will consider two other factors for 
this analysis: the polity of a nation and whether pairs of nations cooperated 
in militarized interstate disputes. Polity, denoted by Pol, measures a nation's 
level of democracy, and ranges from for highly authoritarian regimes to 20 
for highly democratic ones. Cooperation in conflict, denoted by CC, mea- 
sures active military cooperation. If the pair cooperated on a particular 
dispute, it receives a value of +1. However, if the two countries were on 
opposite sides of a dispute, a value of —1 is recorded. If there was more 
than one dispute in a single year involving the same pair, then the pair's 
scores are summed over all disputes in that year. It should be noted that 
all of the covariates except distance are changing over time. 5 This leads to 
the following model, which is motivated by the gravity model, additional 
covariates of interest and the longitudinal network structure: 

In Trade, t = /3 ,t + 0i,t In GDP M + /3 2ji In GDP,- 1 + In B itjtt 

+ /3 4 ,tPol M + /9 5 ,tPolj,i + Pa,tCC w + /57,tPol M x Pol i|t 

+ Si,t + r jtt + 9i,j,t 

with the following diffuse priors: 

ft ~ mvn(0, 100 x I), 

(0„ sr , rs , (f) r )' ~ mvn(0, 100 x I)I($ sr 6 S), 

(Mgg)' ~ mvn (°> 100 x I)I(*fl fl e 5 )> 
r sr ~ inverse-Wishart(4,I), 

a 2 a ~ inverse-gamma(l, 1), 

a\ ~ inverse-gamma(l, 1). 

Initially we implemented the MCMC algorithm outlined in Section 4.1, how- 
ever, we found the Markov chain to be very "sticky." This result may have 
occurred since the semi-conjugate Gibbs proposals are similar to an inde- 
pendence proposal. In this case the distribution of the proposal should be 
close to the respective posterior distribution but should be "fatter" in the 
tails to prevent "stickiness" [Givens and Hoeting (2005)]. This would suggest 



5 For further discussion of the data used in this paper, we refer the reader to Ward and 
Hoff (2007). 
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that we should increase the variance of the semi-conjugate Gibbs proposals 
to increase the rate of mixing. However, the posterior distribution of <3? sr is 
near the boundary for stationary processes, and increasing the variance of 
the proposals may lead to more unaccepted proposed values. Therefore, to 
safeguard against poor mixing of the chain, we randomly alternated between 
using (1) semi-conjugate Gibbs proposals (without an increased variance), 
and (2) random walk proposals around the current values of the parameters 
(<J> S7 -, S sr , § gg , S ss ). A Markov chain of 55,000 iterations was generated, the 
first 10,000 of which were dropped to allow convergence to the stationary 
distribution. Parameter values were saved every 20th scan, resulting in 2,250 
samples with which to approximate the joint posterior distribution. 

5.1. Results. The 95% posterior credible intervals (blue bars) and their 
medians (black dots) for the (3's are in Figure 2. Let us first consider the 
panels on the top row, excluding the intercept. The posterior distributions 
of the coefficients in the gravity model have several features: (1) In general, 
the credible intervals of the coefficients for the In GDP of the exporter are 
shifting downward over the period. Additionally, these intervals contain zero 
from 1994 to 2000, heuristically suggesting that this covariate is becoming 
a less important correlate of bilateral trade flows. (2) The coefficients for the 
In GDP of the importer over the period are all positive, suggesting that the 
economic size of the importer is an important factor in bilateral trade flows. 
(3) As might be expected, over the twenty-year period the medians of the 
coefficients for distance are generally decreasing. An intuitive explanation 
is that the transportation of goods and services has become more efficient 
over the period. 

The four panels on the bottom row of Figure 2 are the results for the 
additional predictors of trade beyond the gravity model. There appears to 
be a general decline in the coefficients for the main effects of polity of the 
exporter and importer over the period, with a notable exception for the lat- 
ter in the year 2000 (the 95% credible interval still contains zero). However, 
there appears to be a rising trend in the coefficients of polity interaction 
(07, t) over the period. The trend suggests that trade between democratic 
countries is increasing faster than the average. Finally, for the polity co- 
efficients in general we see that our estimate is becoming more uncertain 
over time, as the credible intervals are widening over the period. A plausi- 
ble explanation for this phenomenon is that the countries under study are 
becoming more democratic, thus, there is less variation in the polity covari- 
ate. The sample mean and variance of the polity score for 1980 are 3.62 
and 56.66, respectively, while in 2000 they are 7.43 and 20.56. Based upon 
the model, whether two nations cooperate in conflicts is not indicative of 
the level of trade between them, except for the notable case of 1986, where 
bilateral trade is positively correlated with military cooperation. 
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Fig. 2. 95% credible intervals of the covariate coefficients over time for the gravity model. 



We now examine the posterior distributions of the country-specific sender 
and receiver random effects. These effects describe the average deviations 
of a country's export and import levels from those that would be predicted 
by the regression model alone. In Figure 3 the colored dots are a random 
sample of 150 values from the bivariate posterior distribution of the sending 
and receiving effects for each country, and the country labels are located 
at the posterior means. Countries that are close to each other, based on 
their posterior mean, are similar in color. As might be expected, we see in 
each plot that there exists a strong positive relationship between exporting 
(sending) and importing (receiving) and that the relative positions of the 
nations change only slightly over the four years shown in the figure. This 
strong positive relationship suggests a possible model simplification for these 
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data in which the sender and receiver effects are co-linear, although such 
a model reduction may not be appropriate for other data sets. 

A closer examination of the plots reveals that the United States (USA), 
Germany (DEU), Japan (JPN) and the United Kingdom (GBR) are located 
at the top right corner for most of these plots, and thus are considered some 
of the most active nations in the network, even after accounting for their co- 
variate information. On the other hand, nations such as Nepal (NPL), Oman 
(OMN), Barbados (BRB) and Mauritius (MUS) are among the least active, 
based on their location in the plots. Over the period, the rise of East Asian 
countries through trade is exemplified by the movement of Singapore (SGP) 
on the receiving axis — the 95% credible interval of Singapore's receiving po- 
sition in 2000 minus its receiving position in 1981 is (1.449, 3.967). Finally, 
note the dip in imports to Argentina (ARG) in 1991 and Egypt (EGY) in 



16 



A. H. WESTVELD AND P. D. HOFF 



Table 3 

E(0) sr and E(0) ss parameter estimates for the gravity 
model 

Parameter Markov chain 2.5% Median 97.5% 




6.777 9.841 14.733 

2.429 3.665 5.591 

0.597 0.705 0.790 

2.129 2.787 3.829 

10.089 10.292 10.496 

3.113 3.327 3.523 

0.307 0.323 0.338 



2000. In each situation, the value of imports from all countries in the data 
is zero. It is unlikely that there were no imports for either country for those 
years. A plausible explanation for the imports to Argentina being "zeroed- 
out" might be due to a currency reform that the country undertook in 1991. 
As for the Egyptian case, around the year 2000 there was not a period of 
financial instability, suggesting that the zero imports are an aberration in 
the data. We note that, by allowing for time and country-specific importer 
and exporter effects, our estimates of the regression coefficients will be fairly 
robust to such outliers. 

The assumption of a stationary covariance structure allows us to inter- 
pret the the marginal covariances S(0) sr and S(0) 99 as across-year average 
covariances. Using the posterior samples from $ sr .,T sr , § gg and T gg , the em- 
pirical posterior distributions for S(0) sr and T,(0) gg can be computed. The 
results are in Table 3, which presents the trace plots of the Markov chains 
along with the 95% credible intervals and posterior medians. Notice that the 
medians of the posterior distributions for a 2 s and coincide with the spread 
of the posteriors of the sender-receiver estimates for the nations (Figure 3). 
Also from the table we see that: (1) the median posterior correlation p sr 
between the sending and receiving effects is 0.705, and (2) the median pos- 
terior residual correlation p gg within a pair of nations is 0.323. The latter 
suggests a modest degree of reciprocity among pairs of actors in the network 
at a given point in time. 

We also examine the auto-regressive coefficients to see what effect the 
previous year has on exports, imports and reciprocity for the current year. 
From Table 4, the medians of the posterior distributions of (j) s and (j) sr are 
0.997 and 0.005, respectively. This suggests that the level of exports this 
year is highly dependent on level of exports from the previous year but 
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Table 4 

$ sr and <& gg parameter estimates for the gravity model 
Parameter Markov chain 2.5% Median 97.5% 



0.991 
-0.010 
0.121 
0.505 
0.665 
0.100 



0.997 
0.005 
0.161 
0.572 
0.670 
0.106 



1.002 
0.019 
0.201 
0.632 
0.676 
0.111 



perhaps not dependent on imports from the previous year. Comparatively, 
the medians of the posterior distribution for <p r and <p rs are 0.572 and 0.161, 
respectively. That is, the level of imports this year is fairly dependent on 
imports from the previous year and somewhat dependent on exports from 
the previous year, indicating a possible effect of increased purchasing power 
after a year of high exports. Since the median of the posterior distribution 
of (j) gg is 0.106, we see that a relatively small amount of positive reciprocity in 
a given year can be explained by the level of reciprocity in the previous year. 

5.2. Out- of- sample 'prediction. In order to investigate the possibility that 
we are overfitting the data, we randomly deleted 25% of the responses, 
amounting to 16,120 cases, and compared the out-of-sample predictions for 
the LSR model with covariates (Ml) against four submodels (M2-M5). The 
first submodel (M2) used the LSR structure but did not use any covariate 
information (yijj = IH + &i,t + r j,t + 9i,j,t)- The rest of the submodels con- 
sidered (M3-M5) used covariate information along with either only network 
dependence, only temporal dependence, or neither dependence structure: 

(M3) Social Relations Model: 

hiTrade^t = x'i,j,tPt + s i>t + r iit + g iij)t , 

(si,t, ri,t)' ~ mvn 
(9i,j,t, 9j,i,t)' ~ mvn 

(M4) AR(1) Model: 

lnTrade ijj4 = x- jt /3 t + c? ij)t , 

9i,j,t = <i>g9i,j,t-l + £i,j,f, £i,j,t ~ normal(0, 7" 
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Table 5 

Mean- squared- errors for LSR and submodels 



Model 


Temporal dep. 


Network dep. 


MSE 


(Ml) LSR Cov 


yes 


yes 


4.665 


(M2) LSR mean 


yes 


yes 


4.681 


(M4) AR(1) 


yes 


no 


5.554 


(M3) Social relations 


no 


yes 


9.932 


(M5) Standard regression 


no 


no 


14.101 



(M5) Standard Regression Model: 

lnTradei Jit = x\^^ t + £i,j,t 4 , £i,j,t ~ normal(0, 7 2 ). 

For each of the five models, we used the median of the posterior of the 
missing values as our predictor and compared the overall predictions using 
the mean squared error score. Table 5 presents these scores for the LSR 
model with covariates and the four submodels. From the ranking, the LSR 
model with covariates has the best performance, suggesting that we may not 
be overfitting the data. Interestingly, the next best model is the LSR mean 
model and is just slightly worse than Ml, suggesting that the covariates 
add little to the predictive performance after the network and temporal de- 
pendence structures are taken into account. The fact that the AR(1) model 
is next and performs better than the Social Relations model suggests that 
there are strong temporal dependencies in the data and these dependencies 
may be more critical than capturing the second order network dependencies. 
As might be expected, the standard regression model performs substantially 
worse than the others. Finally, it is interesting to examine the estimates of 
the /3's for the standard regression model against those of the LSR model, 
which accounts for the temporal and network dependence inherent in the 
data. The results are shown graphically in Figure 4. The figure illustrates 
two main points: (1) Even though the model for the expected value, un- 
conditional on the random effects, is the same (x' i j t ^t), there is a definite 
difference in the estimated values of the coefficients; (2) The 95% credible 
intervals for standard regression are generally shorter than those for the 
LSR model. However, the length of the intervals of the /3's for the polity 
of the exporter, cooperation in conflict and polity interaction are actually 
shorter for the LSR model compared to those of the standard regression 
model. These results illustrate that accounting for dependency in data typ- 
ically increases the nominal precision of the estimated coefficients, but this 
is not always the case, and depends on the distribution of the covariates 
themselves. 
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Fig. 4. Posterior 95% credible intervals for the LSR (blue) and standard regression model 
(green). 



6. Militarized interstate disputes. Jones, Bremer and Singer (1996) de- 
fined the term militarized interstate dispute (MID) as an event "in which 
the threat, display or use of military force short of war by one member state 
is explicitly directed toward the government, official representatives, official 
forces, property, or territory of another state." In this analysis, we will in- 
vestigate the patterns of MIDs in the Middle East and United States from 
1991 to 2000. 6 For this data analysis, y% j t is the binary indicator of a MID 



6 A list of countries used in this analysis (including their three-letter ISO code) can be 
found in the Appendix. Additionally, the data and some of the R code used to fit the 
model are available as supplementary material [Westveld and Hoff (2010)]. 
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initiated by country i with target j in year t. We are interested in relat- 
ing the response to the following covariates: (1) the ordinal level of alliance 
between i and j, ranging from (no alliance) to 3 (will defend each other 
militarily), (2) the real value of the log trade from % to j, (3) the real value of 
the log trade from j to i, (4) the number of inter-governmental associations 
of which both nations are members, and (5) the log distance between the 
two nations. Note that all of the covariates, except distance, are potentially 
changing over time. 

As discussed in Section 4.2, for the probit mixed effects model the variance 
of gij t is set to one, leading to additional restrictions on the priors for (j} g , 
<f> gg and pgg. Specifically, we considered the following set of diffuse priors: 

/3~mvn(0,100 x I), 

(4> s ,(t>sr, 4>rs,4>r)' ~ mvn(0, 100 X I)I($ 8r € S), 

(<f>gi<f>g g y ~ mvn(0, 100 x T)I($ gg £ S)I(T gg is positive definite), 

T sr ~ inverse-Wishart(4,I), 

Pgg ~ normal(0, 100)I(— 1 < p gg < l)I(T gg is positive definite). 

The posterior distribution for these parameters was approximated with 
a Markov chain Monte Carlo algorithm consisting of 7 million scans. The 
first two million of these scans were dropped to allow for convergence to the 
stationary distribution. Parameter values were saved every 1,000th scan, re- 
sulting in 5,000 samples for each parameter with which to approximate the 
posterior distribution. 

6.1. Results. Figure 5 presents the 95% credible intervals for the coeffi- 
cients of the covariates. We focus attention on the intervals not containing 
zero (with high credibility), suggesting an effect on MIDs. Overall, the pat- 
tern of the intervals for the level of alliance between a pair of nations appears 
mixed. As might be expected, for four of the years (1993, 1996, 1997, 1999) 
the medians are below zero, suggesting a negative impact on MIDs for higher 
levels of alliance — in 1999, the empirical probability that the coefficient is 
below zero is 79%. However, in 1991 and 1994, it appears that the higher 
the level of alliance between two nations, the more likely that they would 
have a MID. A possible reason for this paradox is that Oman has the only 
level 3 alliances in the data and in 1991 it had disputes with both Iraq and 
Jordan, and in 1994 it had a dispute with Iraq. The effect of the number 
of inter-governmental organizations to which a pair of nations belong also 
appears to be minimal over the period, except for the year 1993. The effects 
of the log of exports from the initiator to the target and the log of imports 
from the target to the initiator are very interesting. Over the period, there 
appears to be a slight trend for the coefficients of both of the covariates (with 
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Fig. 5. 95% posterior credible intervals for the covariate coefficients. 



extremely large variability in the final year). 7 These trends suggest that the 
more a potential initiator of a dispute exports to a particular nation, the 
less likely it is for a dispute to occur. This is in contrast to importing from 
a particular country. Finally, distance appears to be a deterrent to conflict; 
the farther a pair of nations are from each other, the smaller the chance of 
a militarized dispute between the pair. 

Figure 6 presents 200 random samples from the bivariate posterior dis- 
tribution of the sender and receiver effects for each country. These effects 
represent deviations of the country-specific rates of initiating and receiving 
MIDs from what would be predicted by a probit regression model alone. 



7 We also fit the model without the year 2000 and found the precision of the /3's for 
1991-1999 to be similar to those in Figure 5. 
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Fig. 6. Sender-receiver effects for the model with covariates. 



From the figure, we see that there are some nations for which the distribu- 
tions do not overlap, suggesting differences between the nations with respect 
to their sending and receiving effects. There appears to be a positive correla- 
tion between the sending and receiving of militarized disputes — the median 
correlation turns out to be 0.563 (Table 6). As the United States (USA) is 
near the far right corner for all the plots in the figure, it is the most active 
in the network over the period. This suggests that the United States has far 
more disputes than would be expected, given just its covariate information. 
In particular, since distance is generally a significant deterrent to disputes, 
the United States has far more disputes than would be expected, based on 
its distance from the Middle East. Note that in 1991, Iraq (IRQ) and Jordan 
(JOR) are also high initiators of disputes. However, Oman (OMN), Lebanon 
(LBN) and Cyprus (CYP) neither initiate nor are the target of many dis- 



LONGITUDINAL RELATIONAL DATA 



23 



Table 6 

E(0) sr and E(0) gg parameter estimates for the MIDs model 
Parameter Markov chain 2.5% Median 97.5% 



0" sor 

psr 

„2 
(J 

Pgg 



1.632 


5.412 


21.088 


0.314 


2.831 


14.373 


0.100 


0.563 


0.846 


1.765 
1 


5.491 
1 


23.056 
1 


0.187 


0.683 


0.955 



putes over the period. In contrast, the results can be compared to those in 
Figure 7, where the analysis was conducted without the covariates; that is, 
only a mean was fit at each point in time (yij t t = [M + Si,t + Tj,t + 9i,j,t)- 
Now the United States is no longer in the upper right-hand corner of the 
plots. However, Cyprus is still in the lower left-hand corner of all the plots. 
In this case, accounting for influential covariate information induces greater 
variability in the sender-receiver random effects. 

Tables 6 and 7 describe the variability of the sender and receiver effects 
and the temporal variation. The median of p gg is 0.683, and while the 95% 
credible interval is quite spread out, its range is completely above zero, sug- 
gesting a certain amount of positive reciprocity in the network at a given 
point in time. However, since the median of the posterior distribution of (p gg 
is 0.189 (and this interval partially contains zero), we see that positive reci- 
procity in a given year may not be readily explained by the level of reci- 
procity in the previous year. Since the median of the posterior distributions 
for (f) s and (f) sr are 0.761 and 0.245, respectively, we see that the initiation of 
disputes by a particular nation depends to a large degree on whether they 
initiated disputes in the previous year and, to a lesser extent, on whether 
they were a target in the previous year. Finally, the median of the posterior 
distributions for <j) r and cp rs are 0.909 and —0.029, respectively. This sug- 
gests that whether a nation is a target this year depends heavily on whether 
they were a target in the previous year, but depends very little on whether 
they initiated disputes in the previous year. 

7. Discussion. This paper has developed a framework that incorporates 
temporal dependence within the domain of social relations regression mod- 
els. We showed that our particular mixed effects model can account for both 
second order network dependence and temporal dependence. By placing the 
temporal dependence on the random effects representing the network depen- 
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Fig. 7. Sender-receiver effects for the mean model. 



dence, the network is allowed to evolve over time. Additionally, a generalized 
linear modeling framework was developed and a general Bayesian estima- 
tion approach was outlined. Specific examples for Gaussian and binary re- 
sponses were illustrated and applied to the study of international trade and 
militarized interstate disputes, respectively. The incorporation of temporal 
dependence allowed for insight into the network of international trade by 
noting that after accounting for covariate information, the level of exports 
in a given year is highly dependent on the level from the previous year, but 
not dependent on the level of imports. Conversely, the level of imports in 
a particular year is fairly dependent on imports from the previous year and 
only somewhat dependent on exports the previous year. Additionally, only 
a slight degree of reciprocity can be explained by the level of reciprocity in 
the previous year. 
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Table 7 

$ sr and & gg parameter estimates for the MIDs model 



Parameter 


Markov chain 2.5% 


Median 


97.5% 


<f> s 




0.560 


0.761 


0.902 


<t>sr 




0.067 


0.245 


0.468 


(bra 




-0.207 


-0.029 


0.155 


(b r 




0.679 


0.909 


1.081 


<Pg 




0.003 


0.741 


0.947 


<t>99 


UNHMi 


-0.034 


0.189 


0.920 



The authors plan to extend the research by (1) considering other ap- 
proaches for modeling the temporal dependence, both stationary and non- 
stationary, and (2) allowing for third-order dependencies, such as those out- 
lined in Hoff, Raftery and Handcock (2002) and Hoff (2005), to be dependent 
over time. 

APPENDIX A: MCMC ALGORITHM FOR THE GAUSSIAN CASE 

Parameter estimation is conducted through the construction of a Markov 
chain in the parameters O. The following MCMC algorithm presents one 
possible construction: 

1. Sample f3 from its mvn(M, V) full conditional distribution, where 

/A-l A \ - 1 

v = £ £ "fas^M+V 1 ' 

\ j=l j=i + l / 

B [i,j] =V[i,j] - sri-rsj. 

2. Sample each srj, i 6 1, . . . , A, from its mvn(M, V) full conditional distri- 
bution, where 

V=((A-l)Z- g 1 + X- 1 )- 1 , 



M = V (^ £ %A 

B M = V[ij] ~ x [ij]P ~ rs r 
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3. Sample (</>*, (jf sr , 4>* s , 4>*)' from a mvn(M, V) distribution, where 

/AT \ -1 

\i=i t=i j 



M = v ( E E 4t-i r -^M + . 

Vi=i t=i / 



sr 



i,t-l u 
Sr Lt-l 



(a) Calculate S* r from §* r and T sr and compute the Metropolis-Hastings 
ratio: 

^ ntidmvn^lO;^,^,,^)) 
~[\f =1 dmvn(sri\0;Y, sr ($ sr ,r sr )) 

duwn(^* sr \M^ ar ;V^ sr ) dmvn(^ sr .|M; V) 
X dmvn($ sr |M$ sr ;FcO X dmvn($* r |M; V) ' 

(b) Accept $* r with probability r A 1. 

9>^W 

/A-l AT \ - 1 

^ = E E E Z '[i,j],t-l T 99 Z [i,j],t-l + 

\ i=l j=l+l t=l / 

/A-l A T 



4. Sample {(j) q ,(t) qq )' from a mvn(M, V) distribution, where 



m =mE E E^i r »W+^ 

\ i=l j=i+l i=l 

ff[i,j],t = V[i,j],t - V[i,j],t - sr i>t - rs jtt , 



z M,t-i 



9i,j,t-l 9j,i,t-l 
9j,i,t-l 9i,j,t-l 



(a) Calculate £* 5 from $* g and T gg and compute the Metropolis-Hastings 
ratio: 

_ Wi=l Uf=i+l dmvn (9[«,i] 1°; ^Ig^gg^aa)) 



Y\t=l Uf=i+1 dmvn (ff[ij] |0; Zgg($gg, Tgg)) 

x dmvn($; fl |M^ 8fl ;Vb g8 ) ^ dmvn($ gg |M; 
X dmvn($ ra |M* 99 ;V$ 99 ) X dmvn($* 9 |M; V) 

(b) Accept $* 5 with probability r A 1. 
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5. Sample r* r from an inverse- Wishart (.AT + v sr , {SS sr + S sr } 1 ) distribu- 
tion, where 

A T 

SS sr = ^ ^Z( sr i,t ~ ^srsr it t-i)(sr it t - $srSri,i 



,t-l) • 



i=l t=l 



(a) Calculate S* r from <3> sr and T* r and compute the Metropolis-Hastings 
ratio: 

^ ntidmvn^lOjE^^,^)) 
n i=1 dmvn(sri|0;S sr .($ sr ,r sr )) 
inverse-Wishart(r* r ,|f sr ; Sj 1 ) 

X i 

inverse-Wishart(r sr |u sr ; 5^. ) 

inverse-Wishart(r sr | AT + v sr ; {SS sr + Ssr} -1 ) 
inverse-Wishart(r* r |AT + v sr ', {SS sr + S sr } -1 ) 

(b) Accept r* r with probability r A 1. 
6. Sample a proposal for T* g as follows: 

[al* | •] ~ inverse-gamma ^a a = N/2 + a a ,Sl= y ^ a m^ / 2 + <^ , 

[°b* I '] ~ inverse-gamma ^aj, = iV/2 + a&, 5^ = ( ^2 tfn\ 

where N = A(A — 1)(T — 1), and set 7^* = (<?£* + of )/4 and A; 9 = (of - 

(a) Calculate £* 9 from $ 99 and T* g and compute the Metropolis-Hastings 
ratio: 

_Ui=i nj=i+idmvn( 5[ij1 |0; 



2 + S b \, 



Ut=i Uf=i+i dmvn( 5[jJ] |0; Z gg ($ gg ,T gg )) 
inverse-gamma^* \a a \ 5 a ) 



inverse-; 


gamma (0% 




5a) 


inverse-^ 


»amma((Tj* 


\atb' 


,s b ) 


inverse-; 


gamma (cr^ 




s b ) 


inverse-; 


gamma(o"^ 




St) 


inverse-^ 


;amma((j^ 


\ai;5l) 


inverse-; 


gamma(<7^ 




4) 



inverse-gamma(o"^* <5jj 
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(b) Accept T* g with probability r A 1. 
7. Sample the missing data yijt from its mvn(Mt,Vt) full conditional dis- 
tribution, where 
(a) t = l: 

= + sr i,i + rs j,i, 

C[i,j],2 = 9[i,j],2 + *gs^[ij],l) 

Mi = viC^r- 1 ^^]^ + s flfl (o)-V[ij],i). 



(b) Kt<T: 



(c) t = T: 



M[i,j],t = V[i,j],t + sr i,t + rs j:U 
C[i,j],t+i = 9[i,j],t+l + ^gg^[i,j],t, 
D [i,j],t-1 = V[i,j],t + $gg9[i,j],t-l, 

Vt = ^' 99 r; 9 1 %g+T- g Y\ 

V[i,j],T = V[i,j],T + SVi,T + rs j,Ti 

D[i,j],T-i = M[ij],T + *ra^[ij],T-i, 

M T = y T (r- 1 % i]>T _i). 



APPENDIX B: MCMC ALGORITHM FOR THE PROBIT MODEL 

In order to augment the previous algorithm for the probit LSR model, the 
Gibbs sampling procedure for each and pair at the following times 
t = 1, . . . , T proceeds by sampling the conditional distribution for each Oij,t, 
based on a truncated normal distribution; the truncation is to the left of 
zero if yijj — an d to the right of zero if yij,t = L 

[0 [iJitt \-]~mvn(M t ,Vt), 

rnormal(M;,y t *)I(^ i)t < 0)1(^ = 0), 
[Oij,m,i,t> 'I ~ \ normal(M t *,V t *)m,j,t > 0)%ij, t = 1), 

f normal(M t *, < 0)I(y i)M = 0), 

L^l^t, "J { novmal(M t *,V t *)m,i,t > 0)%; lM = 1). 
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The means and variances of [^[jjl.tH for t = l,l <t <T, t = T have the same 
expressions as those for y%jt hi Step 7 of Appendix A. For p gg we simply 
suggest using a Metropolis-Hastings update using an uniform proposal dis- 
tribution around the current value. The range of this distribution is the only 
tuning parameter in the Markov chain Monte Carlo algorithm. 

APPENDIX C: SET OF NATIONS IN THE TRADE APPLICATION 

Algeria (DZA), Argentina (ARG), Australia (AUS), Austria (AUT), Bar- 
bados (BRB), Belgium (BEL), Bolivia (BOL), Brazil (BRA), Canada (CAN), 
Chile (CHL), Colombia (COL), Costa Rica (CRI), Cyprus (CYP), Denmark 
(DNK), Ecuador (ECU), Egypt (EGY), El Salvador (SLV), Finland (FIN), 
France (FRA), Germany (DEU), Greece (GRC), Guatemala (GTM), Hon- 
duras (HND), Iceland (ISL), India (IND), Indonesia (IDN), Ireland (IRL), 
Israel (ISR), Italy (ITA), Jamaica (JAM), Japan (JPN), Malaysia (MYS), 
Mauritius (MUS), Mexico (MEX), Morocco (MAR), Nepal (NPL), Nether- 
lands (NLD), New Zealand (NZL), Norway (NOR), Oman (OMN), Panama 
(PAN), Paraguay (PRY), Peru (PER), Philippines (PHL), Portugal (PRT), 
Republic of Korea (KOR), Singapore (SGP), Spain (ESP), Sweden (SWE), 
Switzerland (CHE), Thailand (THA), Trinidad and Tobago (TTO), Tunisia 
(TUN), Turkey (TUR), United Kingdom (GBR), United States (USA), 
Uruguay (URY), Venezuela (VEN). 

APPENDIX D: SET OF NATIONS IN THE MIDS APPLICATION 

Afghanistan (AFG), Bahrain (BHR), Cyprus (CYP), Egypt (EGY), Iran 
(IRN), Iraq (IRQ), Israel (ISR), Jordan (JOR), Kuwait (KWT), Lebanon 
(LBN), Oman (OMN), Qatar (QAT), Saudi Arabia (SAU), Syria (SYR), 
United Arab Emirates (ARE), United States (USA), and Yemen (YEM). 
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SUPPLEMENTARY MATERIAL 

Data and R Code for the Examples (DOI: 10. 1214/10- AOAS403SUPP; 
.zip). A zip file associated with the paper contains the data and some of the 
R code used in the examples. 
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